Conserved quantities of SU(2)-invariant interactions for correlated fermions and the 
advantages for quantum Monte Carlo simulations. 
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In the context of realistic calculations for strongly-correlated materials with d- or /-electrons the 
efficient computation of multi-orbital models is of paramount importance. Here we introduce a set 
of invariants for the SU(2)-symmetric Kanamori Hamiltonian which allows to massively speed up 
the calculation of the fermionic trace in hybridization-expansion continuous-time quantum Monte 
Carlo algorithms. As an application, we show that, exploiting this set of good quantum numbers, 
the study of the orbital-selective Mott-transition in systems with up to seven correlated orbitals 
becomes feasible. 
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The calculation of the electronic properties of materials 
with d- and /-electrons requires highly efficient numerical 
algorithms capable of treating systems of many interact- 
ing fermions. Dynamical mean field theory (DMFT) and 
its cluster as well as diagrammatic extensions have proven 
very successful in predicting one- and two-particle dy- 
namical quantities^— . Moreover, the combination with 
density functional theory makes it possible to predict a 
great number of material-specific effects^—. These theo- 
ries drastically reduce the complexity of the original lat- 
tice problem by mapping it onto an appropriate Anderson 
model (containing either one single impurity or a small 
cluster of them). This however still constitutes a highly 
non-trivial many-body problem, in particular when the 
7-H . impurity site contains more than one orbital, and thcrc- 
0^ ' fore is the bottleneck of these methods. Improvements in 
, the numerical efficiency of the algorithms for solving the 
Q\ ■ impurity model, like the one we discuss here, are of great 
, importance, since they make unexplored regions of the 
O^l ■ model phase diagrams accessible and the study of new 
materials possible. 

Recently continuous-time quantum Monte Carlo (CT- 
QMC) algorithms have been introduced^— . They rep- 
resent a breakthrough in the development of efficient 
"impurity solvers" for strongly correlated electron sys- 
tems. Already from the early stages it was clear that 
multi-orbital models for d- and /-electron systems with 
SU(2)-symmetric "Kanamori" kind of interactions [see 
Eq. ([I]) below] are very well suited to be studied with 
CT-QMC, in particular with the hybridization-expansion 
(CT-HYB )£. In CT-HYB one splits the full Hamiltonian 
of the Anderson impurity problem into an interacting 
part involving the isolated impurity only (Hi oc ), a part 
for the non-interacting bath only, and a hybridization 
between the impurity and the bath. The bath part is 
analytically integrated out and the Monte Carlo simula- 
tion consists of sampling a fermionic trace in which the 
imaginary-time evolution between and j3 = 1 /T is gov- 



erned by Hi oc and at random imaginary-time positions 
creation and annihilation operators for fermions on the 
impurity site are inserted and removed. 

The standard implementation of CT-HYB is formu- 
lated in the eigenbasis of Hi oc and the trace is evaluated 
via a number of matrix-matrix multiplications, which is 
tractable for systems with up to three orbitals. Lauchli 
and Werner— put forward a very elegant solution for 
simulations with more orbitals based on the Lanczos 
algorithm. In this so-called "Krylov implementation" 
the trace is calculated using Lanczos and fast sparsc- 
matrix/vector operations. Independently of the imple- 
mentation used, it is clear that the more one reduces the 
size of the blocks of Hi oc exploiting its good quantum 
numbers, the faster the calculations go^. Therefore, for 
the efficiency of the whole computational scheme, it is 
crucial to identify as many good quantum numbers as 
possible and to make sure that they can be efficiently 
treated by the code. In the present paper we intro- 
duce what we call the "PS" vector, a set of conserved 
quantities for the SU(2)-symmctric Kanamori Hamilto- 
nian which is very simple to implement and that leads to 
a tremendous reduction of the size of the blocks. By us- 
ing it we gain a huge speed-up for calculations with more 
than three orbitals. We exploit this speed-up to study the 
orbital selective Mott transition with SU(2)-symmetric 
interaction in systems with up to seven orbitals. 

The SU(2)-symmetric Kanamori Hamiltonian that is 
widely used for multi-orbital calculations readsi^ 
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The index a runs over the N or b orbitals of the impurity, 
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n a ,<j =d\ a d a a is the number operator counting electrons 
on orbital a with spin a. The first term describes the 
repulsion U for two electrons with opposite spin on the 
same orbital. In the second line of Eq. ([1]) one finds the 
Coulomb interaction U' for two electrons with opposite 
spin on two different orbitals and U' — J, when the spins 
of the two electrons are aligned. The choice U' = U—2J, 
coming from an exact relation between the parameters 
for the case of an isolated atom in a cubic crystal-field, 
is also typically used for realistic calculations. 

For our purposes, it is convenient to work in the occu- 
pation number basis. For iV or 5 = 5, a vector in this basis 
can be symbolically denoted as follows: 
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For reasons that will be clear very soon, we color- 
code singly-occupied orbitals in blue and empty- / doubly- 
occupied orbitals in red. The "density-density" terms 
that appear in the first two lines of Eq. (JTJ) are diago- 
nal in this basis. On the contrary, the two terms con- 
tained in the last line of Eq. (JTJ) generate off-diagonal 
matrix elements. They are called 'spin-flip" and "pair- 
hopping" , respectively, and are needed to preserve the 
SU(2) spin symmetry: The former flips the spins of two 
singly-occupied orbitals while the latter transfers a pair 
of electrons from a doubly-occupied to an empty orbital. 

In the case of many orbitals the size of the basis is 
pretty big (e.g., for five orbitals Hi oc is a 1024x1024 ma- 
trix) but, as we already mentioned, we can reduce Hi oc 
to a block diagonal form by using its good quantum num- 
bers. The most obvious conserved quantities of Hi oc are 
the total number of electrons N and the z-component of 
the total spin S z . In fact, Hi oc docs not connect states 
with different N and docs not change S z either, since 
the "spin-flip" and the "pair-hopping" terms preserve 
the ^-component of the total spin. Also the total spin 
S 2 commutes with Hi oc , an obvious consequence of the 
SU(2) symmetry. Yet, S 2 turns out not to be practical 
to implement and therefore is typically not used. The 
reason for that is the same as for another good quan- 
tum number that is not exploited in CT-HYB codes: the 
"seniority number". This was introduced by G. RacahH 
and "counts" the number of doubly occupied orbitals in 
each state. It is easy to see that this is another conserved 
quantity of Hi oc . However, using N, S z and the "senior- 
ity number" as quantum labels leads to ambiguities in 
the definition of the creation and annihilation operators. 
This can be understood by considering that both | f, |, 0) 
and 1 1, 0, 4-) belong to the same block but d\ ^ connects 
them to 1 1, t-i-i 0) and | f, t) I) which have different val- 
ues of the "seniority number" . S 2 leads to a very similar 
problem. 

Hence, N and S z are the two quantum numbers typ- 
ically used in CT-HYB codes. With this choice, the 
largest block for, e.g., N or b = 5 is 100 x 100 and this is 
still pretty big. The crucial observation that we make 
here is the following: The Kanamori Hi oc connects only 



those states in the occupation number basis that have 
exactly the same singly-occupied orbitals. 
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Looking at the sketch in Eq. ([3]) it is clear that nei- 
ther the "spin-flip" nor the "pair-hopping" process can 
turn a singly-occupied orbital into an empty or a doubly- 
occupied one. This means that the pattern of the singly- 
occupied orbitals (in other words the list of singly occu- 
pied orbitals regardless the spin orientation) is conserved 
by the Kanamori Hi oc . Therefore, even though Hi oc has 
processes among different orbitals, for each orbital a pro- 
jector onto single occupations of this orbital ("PS") com- 
mutes with Hi oc . This defines a vector of operators and 
corresponding quantum numbers 

PS = {K, t - M ) 2 } for a = 1, ...N orb . (4) 

Indeed, (n a ^ — n a .i) 2 yields if the orbital a is cither 
empty or doubly occupied and 1 if the orbital a is singly 
occupied, proving the projective property. The result- 
ing vector of quantum numbers (for which we employ 
the same symbol "PS") is a binary sequence encoding 
the information about the pattern of singly occupied or- 
bitals. The number (not the pattern) of the singly oc- 
cupied orbitals has already been previously exploited as 
good quantum number of Anderson impurity Hamiltoni- 
ans in Ref. FhH . 

Labeling blocks of Hi oc with the set of quantum num- 
bers (N, S z , PS) leads to a tremendous reduction of 
the block size, as shown in the table of Fig. [TJ as well as 
the size of the d and d^ matrices. Indeed, any creation 
and annihilation operator will always connect two blocks 
in which all quantum numbers differ and all the states 
within the block behave in the same way in this respect. 

The most natural implementation of PS defined by Eq. 
(|4]) is to introduce a single label defined, e.g., in a binary 
manner as ^ Q 2 a (n a -j- — n a ,^) 2 . In our code, this informa- 
tion is used to generate H\ oc in a block diagonal structure. 
Already at this stage we can see the improvement gained 
by using PS compared to only N and S z as conserved 
quantities. In Fig. [1] we show the maximum and the 
mean block sizes. The advantage of PS becomes strik- 
ing for system with many orbitals: We obtain block sizes 
which are in average two orders of magnitude smaller 
for N or b = 7. We therefore expect a moderate speed up 
already for four orbitals which should dramatically in- 
crease with seven. More generally, the number of blocks 
increases exponentially with the number of orbitals, and 
the size of the blocks decreases correspondingly. 
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FIG. 1: (color online) Table: Maximum and mean block- 
sizes of the Kanamori Hamiltonian using the total number 
of electrons in the system N, the total spin momentum in 
z-direction S z (second column) and additionally PS (third 
column) as good quantum numbers, for various N or t- Figure: 
Ratio of CT-HYB runtime (speed up) we obtain for an equal 
amount of Monte Carlo steps with and without PS. Three 
independent measurement of the runtime were done for each 
point; the average is plotted and the spread of the results is 
of the order of the symbols. We first consider the lowest-lying 
multiplet of Hi oc as outer state for the trace (good approxima- 
tion at very low temperatures) and then include progressively 
more and more of the lowest multiplets. For N or b = 7, the 
lowest four multiplets cover an energy range of about 4.5 eV, 
i.e. a range of order U. 



To demonstrate how much the use of PS speeds up 
actual calculations, we performed single shot simulations 
on an Anderson impurity model. This consists of N or b 
semi-circular bands of half-bandwidth D = 2 eV. The in- 
teraction parameters of the Kanamori Hamiltonian were 
set to intermediate strength values, namely U = D and 
J = Q.25U. The inverse temperature /3 was set to 100 
eV -1 and the chemical potential was set to the half- filling 
condition [ihf = {N or b — \ )U — (N or b — 1)| J. With this 
model system we performed calculations for N or b varying 
from 1 to 7 with and without the use of PS for other- 
wise identical parameters as single core jobs on an AMD 
machine. 

As an additional parameter we varied the number of 
outer eigenstates of Hi oc over which the fcrmionic trace 
is computed. This is a very convenient and clean way 
of introducing a "truncation" parameter in the "Krylov" 
algorithm. It can be understood as follows: At T = one 
can restrict the computation of the trace to the lowest- 
lying multiplet only. For finite T the calculation is in- 
stead exact only upon performing the outer sum over all 
states of H[ oc , but we observe - similarly to Ref. [ll| 

that the calculation converges rapidly upon including 
more and more of the lowest-lying multiplets of Hi oc . 

In Fig. Q]we show the ratio of the QMC runtime with 
and without PS. This demonstrates that, as expected, 



the advantage of using PS is huge for calculations with 
large number of orbitals. In the Figure, the average of 
three independent timings for each value of N or b are plot- 
ted. If we look at the curves in which the initial and final 
states for the trace are not restricted to the lowest-lying 
multiplet (i.e. the typical situation for calculations at 
room temperature) we obtain a performance gain of one 
order of magnitude for five and a really remarkable gain 
of two orders of magnitude for seven orbitals. This makes 
self-consistent DMFT calculations for such systems really 
accessible. It also enables us to check the convergence 
of DMFT calculations for multi-orbital systems with re- 
spect to the number of multiplets as outer states for the 
trace, which was previously not always possible since the 
simulation was too costly. In addition, this also allows 
us to explore parameter regions which were formerly pro- 
hibitively expensive. 

To demonstrate the practical advantages of using PS 
we apply our implementation of the CT-HYB to a three-, 
five- and seven-orbital model system, as the one sketched 
in the bottom-left corner of Fig. [2 This model is ideal 
for studying the interaction-driven orbital selective Mott 
transition (OSMT), as shown in Ref. It differs from 
the more commonly used model with bands of differ- 
ent widths, since it consists of one central orbital as- 
sociated to a symmetric band and one, two or three or- 
bitals shifted up in energy by A = 0.7£>, where D is half 
the bandwidth, and an equal number of orbitals shifted 
to lower energies by the same A. All calculations were 
performed with the interaction ([T]), at half- filling, with 
J = 0.25(7 and (3D = 100. The DMFT self-consistency 
was reached considering only the lowest-lying multiplet 
as outer states in the trace. The stability of the solution 
versus the inclusion of more multiplets was afterwards 
checked. 

Hitherto, model studies of the orbital selective Mott 
transition with DMFT have focused almost exclusively 
on N or i, < 3. Here we want to test the robustness of 
the OSMT against the number of orbitals. For that we 
compare the cases of N or b = 3, 5 and 7. The only two 
calculations with five orbitals we are aware of are the 
ones of Refs. [l6| and of Ref. |Tl|. Both were done for 
somewhat different models than the one considered here 
but, more importantly, the former was carried out with 
a simplified slave-spin mean-field solver while the latter 
addressed the filling-driven OSMT only. 

Our findings are summarized by the data shown in 
Fig. [2] For N or b = 3 we reproduce the transition values 
reported in Ref. [TH and we find the existence of a simi- 
lar, though somewhat smaller, orbital selective region for 
N or b = 5 and 7. We can therefore conclude that in a 
model with SU(2)-invariant interaction characterized by 
one symmetric band and four or six other ones symmetri- 
cally shifted in energy an orbital selective region exists in 
which the central band gets insulating (its spectral weight 
at the Fermi level A(0) vanishes), while the shifted bands 
stay metallic (finite ^4(0)). 

In Fig. [5] one can see that the critical U dividing the 
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FIG. 2: (color online) Spectral weight at the Fermi level ^4(0) 
for the model sketched in the inset with one symmetric band 
(blue) and the remaining ones symmetrically shifted above 
and below of A = 0.7D. This model as been solved for N or b = 3 
in Ref. [l5|. We clearly observe an orbital selective region, with 
the central band Mott insulating (zero spectral weight) and 
the shifted bands still metallic (nonzero spectral weight) in a 
finite {/-interval up to N or t = 7. 

metallic and the orbital selective regions decreases with 
N or i,. This is a consequence of the effect of the sizable 
value for the Hund coupling J used. For J = 0we would 
have observed the opposite because more orbitals lead to 
a larger mobility and therefore a larger critical value for 
the transition to the insulating state. This delocalizing 
effect is counteracted by the presence of a large Hund 
coupling which strongly suppresses orbital fluctuations 



increasing the insulating region. This is in agreement 
with what was reported in Refs. [T5IIT7IIT8I 

In conclusion, we propose invariants for SU(2)- 
symmctric Kanamori Hamiltonians, i.e. the single occu- 
pation of each orbital. We introduce a related quantum 
label leading to very small blocks of the matrices. This 
results in a speed-up of CT-HYB quantum Monte Carlo 
calculations of up to two orders of magnitude, and allows 
us to study much more comfortably models with a large 
number of orbitals. As an example we have considered 
the interaction-driven orbital selective Mott transition at 
half-filling and found that it persists up to seven orbitals. 
In addition to the class of problems for which PS is 
useful, there are cases in which more complete schema- 
tization of the full Coulomb repulsion are needed. In 
particular the richer multiplet structure of "Slater" -type 
of parametrizations of the Coulomb interaction can play 
a role in some realistic DMFT calculations with five or 
more orbitals. In order to flexibly study such very com- 
plex Hamiltonians with CT-HYB, good quantum num- 
bers as effective as PS would be immensely helpful. 
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